xrrr t ■ 



' ' L h 

oAii' V Jc.' ^-nn05 



NAVAL POSTGRADUATE SCHOOL 

Monterey, California 




THESIS 



PREDICTING COAST GUARD ENLISTED ATTRITION 

by 

Douglas Allen Blakemore 
March 1992 



Thesis Advisor: 



Robert R. Read 



Approved for public release; distribution is unlimited 



T257707 



Unclassified 

SECURITY CLASSIFICATION OF THIS PAGE 



REPORT DOCUMENTATION PAGE 



Form Approved 
OMB No. 0704-0188 



la. REPORT SECURITY CLASSIFICATION 

UNCLASSIFIED 



lb. RESTRICTIVE MARKINGS 



2a. SECURITY CLASSIFICATION AUTHORITY 



2b. DECLASSIFICATION/DOWNGRADING SCHEDULE 



3. DISTRIBUTION/AVAILABILITY OF REPORT 

Approved for public release; distribution is unlimited. 



4 . PERFORMING ORGANIZATION REPORT NUMBER(S) 



5. MONITORING ORGANIZATION REPORT NUMBER(S) 



6a. NAME OF PERFORMING ORGANIZATION 

Naval Postgraduate School 



6b. OFFICE SYMBOL 

OR 



7a. NAME OF MONITORING ORGANIZATION 



6c. ADDRESS (City, State, and ZIP Code) 
Monterey, CA 93943-5000 



7b. ADDRESS (City, State, and ZIP Code) 



8a. NAME OF FUNDING/SPONSORING 
ORGANIZATION 



8b. OFFICE SYMBOL 



9. PROCUREMENT INSTRUMENT IDENTIFICATION NUMBER 



8c. ADDRESS (City, State, and ZIP Code) 



10. SOURCE OF FUNDING NUMBERS 



PROGRAM 


PROJECT 


TASK 


WORK UNIT 


ELEMENT NO. 


NO. 


NO. 


ACCESSION NO. 



1 1 . TITLE (Including Security Classification) 

PREDICTING COAST GUARD ENLISTED ATTRITION 



12 PERSONAL AUTHOR(S) 

BLAKEMORE, Douglas Allen 



13 TYPE OF REPORT 

Master’s thesis 



13b. TIME COVERED 
FROM TO 



14. DATE OF REPORT (Year, Month, Day) 
1992, March 



1 5. Page Count 
59 



16. SUPPLEMENTAL NOTATION 

The views expressed in this thesis are those of the author and do not reflect the official policy or position of the 
Department of Defense or the U.S. Government. 



17. COSATI CODES 


FIELD 


GROUP 


SUB-GROUP 















18. SUBJECT TERMS (Continue on reverse if necessary and identify by block number) 
Attrition, survival analysis, Weibull regression model, Binomial counting model, 
survivor function, accelerated failure time model, Coast Guard Attrition 



19. ABSTRACT (Continue on reverse if necessary and Identify by block number) 

This study examines enlisted attrition behavior for the U.S. Coast Guard and develops a model that projects attrition 
figures. Survival analysis techniques are used to analyze the empirical attrition behaviors associated with an 
Individuals sex, race, marital status, and military occupational skill (MOS). In this study males tend to have higher 
survival probabilities than females, non-caucasians higher than Caucasians, and married persons higher than those 
not married. Aviation MOSs have the highest survival probabilities and technical MOSs have the lowest. 

Modelled survivor functions are developed for two paygrades because each contain small personnel inventories. 
These modelled survivor functions do not fit the data as well as desired but are nonetheless used pending the 
development of sharper alternatives. 

Finally, a counting model based on the Binomial Distribution is developed that projects monthly enlisted attrition 
figures. 



20 DISTRIBUTION/AVAILABILTIY OF ABSTRACT 
|X] UNCLASSIFIED/UNLIMITED □ SAME AS RPT.Q DTIC 



la. REPORT SECURITY CLASSIFICATION 



22a. NAME OF RESPONSIBLE INDIVIDUAL 

Robert R. Read 



22b. TELEPHONE (Include Area Code) 
(408)646-2382 



22c. OFFICE SYMBOL 

OR/Re 



DD Form 1473, JUN 86 



Previous editions are obsolete. 
S/N 0 1 02-LF-0 1 4-6603 



SECURITY CLASSIFICATION OF THIS PAG E 

Unclassified 



Approved for public release; distribution is unlimited. 



Predicting Coast Guard Enlisted Attrition 



by 



Douglas Allen piakemore 
Lieutenant, United States Coast Guard 
B . S . , U . S . Coast Guard Academy 



Submitted in partial fulfillment 
of the requirements for the degree of 

MASTER OF SCIENCE IN OPERATIONS ANALYSIS 



from the 



NAVAL POSTGRADUATE SCHOOL 
March 1992 



ABSTRACT 



This study examines enlisted attrition behavior for the 
U.S. Coast Guard and develops a model that projects attrition 
figures. Survival analysis techniques are used to analyze the 
empirical attrition behaviors associated with an individuals 
sex, race, marital status, and military occupational skill 
(MOS) . In this study males tend to have higher survival 
probabilities than females, non- Caucasians higher than 
Caucasians, and married persons higher than those not married. 
Aviation MOSs have the highest survival probabilities and 
technical MOSs have the lowest. 

Modelled survivor functions are developed for two 
paygrades because each contain small personnel inventories. 
These modelled survivor functions do not fit the data as well 
as desired but are nonetheless used pending the development of 
sharper alternatives. 

Finally a counting model based on the Binomial 
Distribution is developed that projects monthly enlisted 
attrition figures. 
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I . INTRODUCTION 



A. PROBLEM STATEMENT 

The United States Coast Guard (CG) is a military service 
operating within the Department of Transportation. Its main 
missions are aids to navigation, boating safety, defense 
operations, maritime law enforcement, environmental response, 
marine inspection, ice operations, marine licensing, marine 
science, search and rescue, port security and safety, and 
waterways management. The CG maintains an active duty 
hierarchical personnel structure with approximately 6,000 
officers and 30,000 enlisted members. The CG Personnel 
Workforce Planning (PWP) office located in Washington, D.C. 
has the overall responsibility for projecting the active duty 
enlisted personnel needs of the CG. Their responsibilities 
include the calculating of personnel stocks, promotion 
requirements, attritions, and recruitment needs. The current 
method for predicting monthly enlisted attrition by paygrade 
is to calculate a running average or mean of past monthly 
attrition and use this mean as a forecast of future attrition. 
This forecasting method is crude, ignores many issues, and is 
not accurate enough to meet the needs of the CG. 
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Why does a person separate from the CG? What are the 
characteristics associated with this person? Are these 
characteristics the same for each paygrade? Previous CG 
studies have focussed on the first question but have not 
looked at attrition data analytically. The first question 
will not be addressed. This thesis will look at the next two 
questions and will propose a method, based on a survival 
analysis of CG enlisted attrition data, for calculating 
monthly personnel attrition. This study is a first attempt at 
examining CG enlisted attrition using data analysis tools. It 
is designed to provide a procedure for studying attrition, and 
for developing a basic model that calculates monthly attrition 
which can be expanded by future studies. 

B. COAST GUARD ENLISTED PERSONNEL BACKGROUND INFORMATION 

The Coast Guard enlisted personnel structure is composed 
of 9 paygrades, E-l to E-9, lowest to highest respectively. 
(There is an E-10 position filled by one person but this 
thesis does not address this paygrade.) Paygrade E-l is 
composed of persons attending CG basic training school and 
paygrade E-2 is composed of persons who have completed basic 
training school, have been assigned to active duty commands, 
and are preparing to attend a CG specialty school (also called 
A- school) . Paygrades E-3 through E-9 are divided into 24 
active duty military occupational skills (MOS) or sub- 
specialties listed below: 
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Aviation Machinist's Mate (AD) 

Aviation Electrician's Mate (AE) 

Aviation Structural Mechanic (AM) 

Aviation Survivalman (ASM) 

Aviation Electronic Technician (AT) 

Boatswain's Mate (BM) 

Damage Controlman (DC) 

Electrician's Mate (EM) 

Electronics Technician (ET) 

Fire Control Technician (FT) 

Gunner's Mate (GM) 

Health Services Technician (HS) 

Machinery Technician (MK) 

Musician (MU) 

Marine Science Technician (MST) 

Public Affairs Specialist (PA) 

Quartermaster (QM) 

Radarman (RD) 

Radioman (RM) 

Storekeeper (SK) 

Subsistence Specialist (SS) 

Sonar Technician (ST) 

Telephone Technician (TT) 

Yeoman ( YN) 
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An enlisted man (EM) receives an MOS after either 
completing a specialty school or by completing an intensive 
on-the-job training program. Both methods are usually 
completed while the EM is in the E-3 paygrade. An E-3 who has 
not obtained an MOS is called nonrated; E-3's through E-9's 
hold only one MOS at a time and usually keep that MOS 
throughout their careers. 

Promotions to the next highest paygrade are performed by 
MOS and are dictated by the needs of the CG. Promotions to 
paygrades E-5 through E-9 are vacancy driven and promotions to 
paygrades E-3 and E-4 occur on qualification for advancement. 
Promotion to paygrade E-2 occurs upon completion of basic 
training. 

Enlisted personnel sign service obligation contracts 
(enlistments) that require the person to serve in the CG for 
a set number of years. Enlistments usually last four years. 
(There have been two year enlistments but these have been 
terminated.) Upon completion of an enlistment and upon 
approval by the CG, a person may sign a new contract 
(reenlist) , or separate from the CG. Enlisted attrition 
generally occurs because of: 

retirement - minimum 20 years active duty service, 

non- reenlistments - an EM chooses not to reenlist or the 

CG chooses not to reenlist the person, 
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administrative reasons - an EM may depart the CG prior to 
the end of his/her enlistment "for the convenience of the 
government " , 

death or disability, 

selection to an officer program. 

For the purposes of this study the term separation refers to 
a person leaving the CG for any of the above reasons. 

C . PROCEDURE 

This thesis examines, by paygrade, individual personnel 
characteristics associated with attrition and develops a model 
that calculates monthly attrition. Survival analysis 
techniques are used to study behaviors associated with these 
attrition characteristics. Survival analysis is a data 
analysis technique that is useful in examining failure time 
data. Many of the visible applications have been in medical 
research where a typical "failure" is the death of a patient 
who had been subjected to some type of treatment. Obviously 
it would not be practical to wait for all the patients to 
"fail" before analyzing the data. It is more convenient to 
end the experiment at some specified time and do an analysis 
of all the data. The observables that did not "fail" by the 
end of the data collection period are called censored 
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observations. They must be treated differently since their 
failure times are yet to occur and are hence unknown. 
Survival analysis allows for censored failure times and 
provides techniques for including them in the analysis. The 
primary tool used in the survival analysis will be the 
estimation of survivor functions or curves. These curves 
provide probabilities that a person survives up to and beyond 
a set time, t. Their estimators are often called product 
limit estimators because of the technique used. Survival 
probabilities are the compliment of attrition rates and are 
needed to calculate expected monthly attrition figures. 

The personnel data are cross classified by paygrade and 
each paygrade is partitioned by certain individual personnel 
characteristics. Paygrades E-l and E-2 will be partitioned by 
the characteristics of sex, marital status, and race. 
Paygrade E-3 will be partitioned by these same characteristics 
and by whether the person had obtained an MOS or was nonrated. 
Paygrades E-4 through E-9 will be partitioned by the same 
characteristics as E-ls and E-2s and by CG MOS. MOSs will be 
grouped according to categories listed in Table 1 on 
page 7. 
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TABLE 1. MOS GROUPINGS 



Group 


Rates 


Aviation 


AD, AE, AM, ASM, AT 


II Administrative 


YN, SK 


Technical 


EM, ET, FT, MST, ST, TT 


Skilled 


HS, SS, MU, PA 


CG Professional 


BM, DC, GM, MK, QM, RD, RM 



MOSs are grouped for two reasons. MOSs within the same 
group have similarities that should cause them to exhibit 
common attrition behavior and hence have common survival 
probability values. The MOSs within each category all have 
several common attributes; specifically, the type of training 
needed to acquire the MOS, the type of duty or jobs associated 
with those MOSs, and the possible civilian employment 
opportunities available for persons with these MOSs. MOSs are 
also grouped to increase the personnel inventory within each 
paygrade required for a survival analysis. Chapter II 
contains an explanation on these MOS groups and examines the 
data used in this study. 

For most MOSs stocks are large and empirical survival 
probabilities can be used. Modelled survival probabilities 
are needed if personnel stocks are smaller than preferred. 
Small personnel stocks are subject to large data variability; 
models are developed to provide stability. Paygrades E-8 and 



7 



E-9 could be considered too small to use empirical survival 
probabilities, thus regression models are examined in Chapter 
IV for these paygrades. 

A model that projects an expected number (and variance) of 
monthly attritions will be developed and demonstrated in 
Chapter V. This model is based on principles of the Binomial 
distribution. Paygrades will be partitioned several times to 
produce multiple binomial experiments from which expected 
values and variances can be calculated. The product limit 
estimates generated by the survival analysis will correspond 
to the probabilities of success associated with a binomial 
experiment . 

Finally conclusions and recommendations are contained in 
Chapter VI. Since this is the first time in recent years that 
enlisted attrition has been viewed using operations analysis 
tools, most of the conclusions should be unique. It is 
expected that additional studies utilizing more refined data 
will be needed to better understand, analyze, and predict CG 
enlisted attrition. 
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II. DATA OVERVIEW 



A. POPULATION 

Data for this thesis were provided by the CG Pay and 
Personnel Center located in Topeka, Kansas. The data were 
provided in two sets. The first set contained individual 
personnel records for all persons who separated during FY 91 
(1 November 1990 - 31 October 1991) . The second set contained 
individual personnel records for all enlisted persons who were 
on active duty the last day of FY 91 (31 October 1991) . This 
second data set is in a sense a "snapshot" of the GC EM 
composition on 31 October 1991. In both data sets each 

personnel record contained the persons sex, MOS (if 
applicable) , rank, marital status, ethnic origin, date of 
current enlistment, and date of separation (if applicable) . 
Each persons length of service (in months) can be calculated 
from the data entries date of current enlistment and date of 
separation. It should be noted that the entry, date of 
current enlistment, did not necessarily correspond to when the 
person first joined the CG. This affects the calculation of 
a persons time in service. In certain cases the CG allows an 
EM to separate from the CG and then allows the person to 
rejoin the CG in the future at his or her past paygrade. Such 
persons' past service time is not ignored. The person 
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maintains all accumulated service time for pay and retirement 
purposes. This condition was not reflected in the data 
provided for this study. There are numerous records of 
senior EMs having relatively little time in service when in 
all likelihood they have quite a bit of service time. This 
situation is most prominent in the survival analysis results 
of Chapter III. Appendix A contains procedures and Fortran 
programs needed to access and use the data. 

B. LIMITATIONS 

Analyzing only one years amount of personnel data presents 
two problems. The first and foremost is that the data lack 
temporal stability. More than one years worth of data is 
required to provide stability. Attrition study conclusions 
and modelling results based on unstable data are not as 
reliable as if they were based on time tested data. Secondly, 
the data may not have been as reliable as preferred because 
data inputs could not be verified. The FY 91 data were placed 
in storage by the Coast Guard PPC and were not accessible 
during the study. Recommendations contained in Chapter VI 
address these shortcomings further. 

C . MOS GROUPS 

As stated in Chapter I MOS groups were formed because of 
common attributes associated within each group. These 
attributes are listed in this section. 
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1. Aviation 



Individuals with aviation MOSs have common A- school 
characteristics and common duty stations. (The CG has three 
basic types of duty stations: ashore, afloat, and air 
stations.) The aviation A-schools (specialty school) waiting 
lists are relatively long compared to the other MOS groups. 
This lengthens a persons service time in paygrade E-3, thus 
increasing the amount of time needed for promotion to E-4. 
Unlike many MOSs, individuals within this group are limited in 
their types of duty stations. They spend most of their entire 
career at CG air stations. 

2 . Administrative 

These MOSs are again grouped because of their A- school 
characteristics, choice of duty locations, and possible 
civilian employment opportunities. The administrative A- 
schools have short waiting lists and short school durations. 
Promotion to E-4 in this MOS group is relatively fast compared 
to other groups. Administrative persons also can serve in 
almost any CG duty station or command. Since these persons 
skills are mainly clerical in nature it is felt that their 
civilian employment opportunities are limited. 

3 . Technical 

Persons with technical MOSs share the same type of 
characteristics as the administrative MOS group, but with two 
opposite attributes. Their A-schools have long waiting lists, 
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are long in duration, and have low graduation rates (as 
compared to the other MOS group A- schools) . Like the 
administrative group, persons in this group can serve in 
almost any CG command. However, unlike the above group, 
technically grouped persons probably have more civilian job 
opportunities . 

4. Skilled 

This group is characterized by their scope of possible 
civilian employment opportunities. Persons within this group 
probably have marketable skills, possibly not as marketable as 
the technical group but more marketable than any other rate 
group . 

5. CG Professional 

This group is characterized by MOS obtainment and 
employment opportunities. As discussed in Chapter I, MOSs are 
obtained by graduating from an A- school or by completing an 
on- the job training program. This group is the only MOS group 
that incorporates on-the-job training programs for a number of 
their MOSs. MOSs are also designed for CG specific functions 
(ie. shipboard machinists, CG weapons technicians. Shipboard 
navigators, etc.). These skills have limited value for 
civilian employment. 
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D. SUMMARY OP PERSONNEL STOCKS 



Table 2 provides a summary of personnel stocks 
partitioned by the rank, sex, race, and marital status for all 
CG EMs who were on active duty as of 31 October 1991. 



TABLE 2. ACTIVE DUTY PERSONNEL STOCKS FY 91 



Pay- 

grade 


Total 


Male 


Female 


Cauc . 


Non- 

Cauc. 


Mar. 


Not 

Mar. 


E-l 


732 


622 


110 


595 


137 


68 


664 


E-2 


3383 


3027 


356 


2776 


607 


622 


2761 


E-3 


3651 


3358 


293 


3031 


620 


1256 


2395 


E-4 


7064 


6364 


700 


6043 


1021 


3641 


3423 


E-C 


5904 


5410 


494 


4830 


1074 


4278 


1626 


E-6 


5748 


5427 


321 


4878 


870 


4805 


943 


E-7 


2764 


2696 


68 


2436 


328 


2436 


328 


E-8 


569 


564 


5 


501 


68 


517 


52 


E-9 


289 


289 


0 


256 


33 


270 


19 


Total 


30,104 


27,757 


2347 


25,346 


4758 


17,893 


12,211 



Note that race is divided into two categories: Caucasian 
and non- Caucasian. There are relatively few (as compared to 
the total EM population) minorities in the CG. All minorities 
were grouped to increase the personnel stocks within the race 
category. As can be seen the CG is a male, Caucasian 
dominated service. Table 3 on page 14 provides the same 
statistics for Coast Guardsmen who separated during FY 91. 
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TABLE 3. PERSONNEL STOCKS FOR PERSONS SEPARATING DURING FY 91 



Pay- 

grade 


Total 


Male 


Female 


Cauc. 


Non 
Cauc . 


Married 


Not 

Married 


E-l 


779 


637 


142 


601 


178 


83 


696 


E-2 


522 


438 


84 


429 


93 


115 


407 


E-3 


831 


731 


169 


712 


119 


276 


555 


E-4 


1248 


1094 


154 


1079 


169 


602 


696 


E-5 


438 


398 


40 


402 


36 


281 


157 


E-6 


385 


369 


16 


348 


37 


317 


68 


E-7 


315 


305 


10 


271 


44 


277 


38 


E-8 


79 


77 


2 


73 


6 


73 


6 


E-9 


46 


46 


0 


45 


1 


39 


7 


Total 


4643 


4095 


548 


3960 


683 


2063 


2850 



Approximately the same proportions of males to females, 
Caucasians to non- Caucasians, and married persons to persons 
not married separated from the CG as did those who did not 
separate during FY 91. 

Table 4 on page 15 provides on a summary of personnel 
stocks for FY 91 active duty personnel by MOS group. 
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TABLE 4. FY 91 ACTIVE DUTY PERSONNEL STOCKS BY MOS GROUP 



Paygrade 


Aviation 


Admin. 


Technical 


Skilled 


CG Pro. 


E-3 


20 


41 


84 


71 


333 ! 


E-6 


1142 


970 


1031 


715 


3206 


E-5 


921 


949 


781 


490 


2763 


E-6 


748 


899 


837 


527 


2737 


E-7 


262 


353 


452 


260 


1437 


E-8 


77 


88 


98 


55 


251 


E-9 


43 


40 


38 


18 


150 


Total 


3213 


3340 


3321 


2136 


10,877 



The numbers in paygrade E-3 are small because not all E-3s 
have an MOS. Overall approximately 48% of all personnel 
belong to the CG professional MOS group. About 14% belong in 
the aviation group; 15% belong in the administrative group; 
14% belong in the technical group; and 9% belong in the 
skilled rate group. Percentages within each paygrade are 
roughly the same. Table 5 on page 16 provides the same 
statistics for personnel who separated during FY 91. 
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TABLE 5. FY 91 SEPARATION PERSONNEL STOCKS BY MOS GROUP 



Paygrade 


Aviation 


Admin. 


Technical 


Skilled 


CG Pro. 


E-3 


1 


5 


12 


23 


65 


E-4 


75 


157 


180 


100 


733 


E-5 


26 


46 


51 


26 


246 


E-6 


31 


46 


51 


36 


221 


E-7 


22 


51 


23 


26 


193 


E-8 


11 


15 


14 


8 


31 


E- 9 


3 


6 


3 


3 


31 


Total 


169 


328 


373 


221 


1518 



E . DATA PARTITIONING 

The data was partitioned by rank to form 9 paygrades. 
Each paygrade was further divided by the personnel 
characteristics sex, race, marital status, and MOS groups 
listed in Table 1 on page 7. 

1. Paygrades E-l and E-2 

These paygrades were divided by the sex, race, and the 
marital status of the individual. Marital status indicated 
whether or not a person was currently married. 

2 . Paygrade E - 3 

In addition to the above, this paygrade was 
partitioned by the obtainment of an MOS. The person either 
had obtained one of the MOSs listed in Chapter I or had not 
(nonrated) . 
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3. Paygrades E-4 through E-6 

These paygrades were also partitioned by sex, race, 
and marital status. Additionally they were partitioned by MOS 
groups . 

4. Paygrades E-7 through E-9 

These paygrades were divided by race, marital status, 
and MOS group. Sex was not examined because of the scarcity 
of females in these paygrades. 
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III. SURVIVAL ANALYSIS 



A. INTRODUCTION 

As stated in Chapter I, survival analysis provides a 
method for analyzing censored failure time data. Failure 
times in this study refer to the number of months a person 
serves on active duty prior to separation. Failure times are 
assumed to be discrete and are rounded up to the next whole 
month figure. For example, if a person separates after 31 
months, six days of service his or her failure time is 32 
months. This chapter reviews the theory of survival analysis 
and graphically compares survivor functions of individual 
paygrades that have been partitioned by single personnel 
characteristics. Appendix C contains the SAS LIFETEST 
procedure and Fortran code required to estimate the survivor 
functions . 

B. COMPUTATION OF SURVIVOR FUNCTIONS 

Let T be a nonnegative random variable representing the 
separation time of an EM from a homogeneous population. The 
survivor function is defined to be the probability that T is 
at least as great as some time, t, 
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S ( t) -Pz(T^t) ; 0 <; t < ~ . 



(3.1) 



From a simple random sample, this can be estimated by 



Sit) =l-( 



number of_ qbservation£ with failur_e time £ t 
total number of observations 



) . (3.2) 



If T is a discrete random variable taking values x t < Xj ... < 
Xj, with associated probability function f (x;) = Pr(T=Xj), then 



where H is the Heaviside function, H(x)={0 if x<0; 1 if xaO} 
[Ref. 1 : pp . 5-7] . 

The presence of censored data, however, weakens the 
estimation of the survivor function in the right tail. All 
that can be safely assumed about an observation censored at 
time t is that the unobserved failure time is greater than t. 
This condition is relieved by using an alternate formulation 
of the survivor function using hazard rates. For discrete 
random variables, the hazard rate, X,, is the probability that 
the observation will immediately fail at time t given is has 



j 



Sit) =£ fixjHiXt-t) , 



(3.3) 
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survived to time t. In the discrete case this can be 
estimated by 

^ number of observations that fail at_t ^ ^ 

c number of observations with failure time > t’ 

Using hazard rates [Ref. 3], the survivor function may be 
expressed as 



Sit) = JI (1-X i ) . (3.5) 

iix^t 

where \ is the hazard rate at Xj for i=l,...,j. Estimating \ } 
would allow an estimation of the survivor function, S(t). 
Chapter 2 of Reference 1 contains a detailed explanation of 
the computation of the estimated survivor function. 

C. SURVIVAL ANALYSIS RESULTS 

In all paygrades the marital status characteristic and sex 
characteristic behaviors were identical; married persons 
appeared to have a higher probability of survival than those 
who were not married, and males tended to have higher survival 
probabilities than did females. (See Figures 1 and 2 on page 
40.) Non- Caucasians in paygrades E-3 through E-9 had higher 
survival probabilities than did Caucasians. Results for other 
characteristics are given below. 
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1. E-l 



Caucasians and non- Caucasians have about the same 
survivor functions until about two months. After two months 
Caucasians tend to have higher survival probabilities. (See 
Figure 3 on page 41.) 

2. E-2 

Caucasians had higher survival probabilities than did 
non- Caucasians up to about 20 months. (See Figure 4 on page 
41.) After that non- Caucasians had higher survivor 
probabilities. Also note the significant drop in 
probabilities at 24 months. The CG for a brief time allowed 
two year enlistments. This drop corresponds to enlisted 
persons separating at the end of a 2 year enlistment. 

3. E-3 

Individuals possessing an MOS had higher survival 
probabilities than did nonrates after 22 months. (See Figure 
5 on page 42 . ) 

4. E-4 and E-5 

The aviation MOS group in both paygrades had the 
highest survival probabilities and the technical group had the 
lowest. The remaining three groups survival functions fell in 
between the aviation and technical survivor functions. (See 
Figure 6 on page 42 . ) 
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5. E-6 



All MOS groups in this paygrade were very similar as 
shown in Figure 6 on page 42. All had very high probabilities 
of survival and all were very similar in shape. Again the 
aviation group had the highest survival probabilities. 

6. E-7, E-8, and E-9 

The survivor functions in all three paygrades appeared 
fairly flat with slightly decreasing slopes until about 220 
months. (See Figure 7 on page 43.) After 220 months all 
survivor functions decreased at about the same rate. All 
survivor functions within these paygrades are characterized by 
high survival probabilities with the aviation group having the 
highest probabilities of survival. 
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IV. MODELLED SURVIVOR FUNCTIONS 



A. INTRODUCTION 

Regression models are typically used to model survival 
experiences from homogeneous populations [Ref. 1] . These 
models allow for failure times to be dependent on observed 
explanatory variables. These explanatory variables can be 
either qualitative or quantitative. Two common regression 
models are the exponential regression model and the weibull 
regression model. Weibull regression models will be 
constructed and analyzed for paygrades E-8 and E-9. The 
primary reference for this chapter is Chapter 2 and Chapter 4 
of Reference 1. 

B. WEIBULL REGRESSION MODEL 
1. Weibull Distribution 

The Weibull distribution is a generalization of the 
exponential distribution that allows for a power dependence, 
p, of the hazard of time. The Weibull distribution is in fact 
an exponential distribution when p=l. The probability density 
function (pdf) of the Weibull is 

f(t)=Apat) p - 1 e ( ' at) " ) for tzO . (4.1) 



The hazard function is 
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k (t) =kp(kt) p - 1 



(4.2) 



and the survivor function is 



S (t) =e <_(it)f>) . (4.3) 

A convenient check to verify that data fits the 
Weibull distribution is to plot the log [-log S { t ) ] versus log 
t(S(t) is the empirical estimate of the survivor function). 
If the data can be fitted by the Weibull distribution, the 
plot should provide a relatively straight line with the slope 
of the line roughly estimating the value of p. 

2. Log-Linear Regression Model 

The Weibull distribution can be generalized to obtain 
a Weibull regression model by allowing the failure rate to be 
a function of a vector of qualitative personnel 
characteristics, X. In this application X is a row vector 
consisting of the binary variables X 4Vi4tioo , X 4dmini4trative , X^^, X skiUed , 
and X cgprofeision4l which take on a value of 1 if the observation 
falls into the subscripted category, and are 0 otherwise. 

If the conditional (on X) hazard is 

\(t\X) =Xp(\ tK-'e * 8 
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then the conditional density of T is 



f(t \X) =Xp(Xt) p - 1 e w e-' (X ' ),eX ' 1 (4.4) 

where /3= (/3, , . . . , /S n ) is a vector of regression parameters. In 
terms of Y=log T, equation (4.4) is the linear model 

Y=oi + X(3'+ aW (4.5) 

where Q!=-log X, cr=p' 1 , /3*=-a/3, and W has the extreme value 

distribution. In equation (4.5) the personnel characteristics 
act additively on the time scale, vice multiplicatively in 
equation (4.4). (Ref. 1] 

3. Modelled Survivor Function 

Equation (4.5) is known as an accelerated failure time 
model. It is characterized by the scaling of the failure time 



T=e' w r A 



where T 0 is a failure time from a baseline when the values of 
X are all zero. Using this time scaling the modelled survivor 
function is 
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(4.6) . 



S(t) = e - (Xe ~ >,y 

In equation (4.6), S(t) can be estimated by inserting 
appropriate values for X and t, and estimated values of /?, X, 
and p. This will produce a modelled survivor function which 
can be plotted to produce curves like those in Chapter III. 
If this modelled survivor function is similar (in shape and 
magnitude) to the empirical survivor function, then the model 
can be used. If it is not similar, then it possibly should 
not be used. 

C. RESULTS 

The partitioned data sets were regressed using the SAS 
LIFEREG procedure [Ref. 4] to obtain estimates for /?, X, and 
p. These values along with appropriate values for X were 
inserted into equation (4.6) to estimate modelled survivor 
functions. Figures 8 through 13 on pages 43 through 46 
contain plots of these modelled functions against empirical 
survivor functions. Generally the modelled survivor functions 
do not fit the data as well as desired. 

Nonetheless, modelled survivor functions for these two 
paygrades will be used pending the development of sharper 
alternatives. The partitioned data sets are so small, it is 
felt that a model will produce the most reliable and stable 
probability estimates. 
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V. BINOMIAL COUNTING MODEL 



A. INTRODUCTION 

The Binomial Distribution provides a means for calculating 
expected values and variances of sums of random variables that 
have one of two outcomes: success or failure. This study is 
concerned with estimating the number of enlisted who separate 
on any given month (or conversely the number who do not 
separate) . This chapter develops a simple model, based on 
principals of the Binomial Distribution, that calculates 
monthly survival figures per paygrade, and provides an example 
of how the model works. The primary reference is Chapter 3 of 
Reference 5. 

B. BINOMIAL DISTRIBUTION 

A binomial experiment is an experiment that possesses the 
following properties: 

- the experiment consists of n identical trials, 

- each trial results in one of two outcomes, a success or 
failure , 

- the probability of success on a single trial is equal to 
p and remains the same from trial to trial. 
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- trials are independent, and 

- the random variable of interest is Y, the number of 
successes in n trials. The expected value of Y is 
E(Y)=np, where n is the number of trials and p is their 
corresponding probability of success. [Ref. 5] 

A person separating from the CG meets these standards and 
can be considered a Bernoulli trial. Each term trial results 
in one of two outcomes. A success is marked if the person 
does not separate, a failure when the person leaves the CG. 
The product -limit -estimate of the survivor function provides 
the probability of a person surviving to and beyond a time, t; 
P (Tat) for t=l, . . .00. This probability estimate remains the 
same for each trial (ie. for all persons with length of 
services equal to time t) . The decision by a person to 
separate (or the CGs decision to separate a person) is not 
affected by another persons choice and thus independent of any 
other persons decision. And finally, Y equals the number of 
persons not separating within a paygrade who have survived 
beyond a time t. Using these definitions a basic model can be 
constructed that generates an expected value and variance for 
the number of persons that do not separate on a given month. 
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C. COUNTING MODEL 



Paygrades are partitioned in two steps to form data cells 
that consist of independent and identical trials required of 
the Binomial Distribution. The first step separates each 
paygrade into the single personnel characteristic examined in 
the survival analysis. These will be called sets. The second 
step further partitions the paygrade by the amount of time 
each person has served, or Time in Service (TIS) cells. 
Figure 14 provides a tree diagram example of this partitioning 
scheme. 




Figure 14. Counting Model Partitioning Scheme 



T is the maximum TIS any one person has within each set. 
Paygrades E-l and E-2 are partitioned by race to form two data 
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sets each; paygrade E-3 is partitioned by MOS obtainment; and 
paygrades E-4 through E-9 are partitioned by MOS group. Each 
cell has a corresponding probability of success, pj, (either 
estimated from the actual data or generated by a model) which 
was defined in Section 4.D.1 to be the probability of survival 
being greater than or equal to t. Given n^, the expected 
number of persons who fall into cell j , the expected number of 
successes (persons that do not separate) in each cell is 

E(Yj)=npj ; j=l,...,T. (4.7) 



The cell variance is 



Var(Yj) 



(4.8) 



where qj=l-pj. The expected number of successes within each 
set is the sum of the cell expected values 



T 

E(Y sa ) =Y, E{Y j) (4.9) 

y-i 
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and by summing cell variances the set variance is 



T 

VariYj ^VariYj) 



}* i 



(4.10) 



The expected number of successes per paygrade is calculated by 
summing set expected values . Paygrade variance is the sum of 
all set variances. 

Appendices C and D contain procedures for generating 
product limit estimates which correspond to the p^s. 

D . EXAMPLE 

Data from paygrade E-l is used to illustrate this counting 
model. Paygrade E-l contains 2 paygrade sets. Set! contains 
records for Caucasians and Set 2 contains records for non- 
caucasians. Table 6 and Table 7 on page 32 contain frequency 
counts of the number of persons falling within each cell per 
set . 
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TABLE 6. PAYGRADE CELL FREQUENCY COUNT CAUCASIAN E-l 



| Month 


Freq 


Month 


Freq 


Month 


Freq 


Month 


Freq 


1 1 


53 


11 


2 


27 


1 


39 


3 


1 2 


339 


12 


4 


30 


2 


40 


2 


3 


148 


15 


3 


31 


1 


42 


. 


3 


8 


16 


2 


42 


1 


43 


1 


5 


2 


21 


4 


34 


1 


48 


1 


6 


1 


23 


3 


36 


1 


60 


1 


7 


1 


25 


2 


38 


1 


>60 


5 



TABLE 7. PAYGRADE CELL FREQUENCY COUNT NON -CAUCASIAN E-l 



Month 


Freq 


Month 


Freq 


Month 


Freq 


Month 


Freq 


1 


2 


4 


6 


16 


1 


52 


1 


2 


87 


13 


1 


18 


1 


>52 


0 


3 


35 


14 


l 


20 


1 




1 



Tables 8 and Table 9 on page 33 contain the estimated pj values 
for each data sets. 
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TABLE 8. PRODUCT LIMIT ESTIMATES PAYGRADE E-l CAUCASIAN 



Month 


P 


Month 


P 


Month 


P 


1 


.88 


15 


.36 


28 


.256 


t 


. 646 


16 


.345 


28 


.243 


3 


.5 


17 


.337 


29 


.23 


8 


.468 


16 


.33 


3$ 


.225 


5 


.441 


19 


.326 


31 


.22 


6 


.431 


20 


.318 


32 


.21 


t 


.424 


21 


.314 


33 


.2 


8 


.417 


22 


.294 


35 


.2 


11 


.417 


28 


.294 


36 


.1 


12 


.393 


24 


.286 


40 


.1 


13 


.889 


28 


.265 


64 


.05 


14 


.375 


26 


.256 


62 


0 



TABLE 9. PRODUCT LIMIT ESTIMATES PAYGRADE E-l NON- CAUCASIAN 



Month 


P 


Month 


P 


Month 


P 


8 


.889 


13 


.225 


37 


.107 


8 


.646 


14 


.213 


35 


.107 


3 


.444 


15 


.173 


40 


.089 


6 


.403 


16 


.173 


43 


.089 


6 


.368 


17 


.173 


40 


.071 


6 


.344 


16 


.159 


40 


.071 


7 


.344 


19 


.159 


40 


.054 


8 


.332 


20 


.143 


64 


.054 


10 


.332 


30 


. 143 


65 


.027 


11 


.249 


31 


.125 


66 


0 


12 


.249 


36 


.125 
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Substituting ftj the values from tables 6 and 7 for the nj 
values and Pj the estimated probability values from Tables 8 
and 9 into equation (4.9), the estimated expected value of 
each set is 



E(Y X ) 



60 



= E ^ =353 * 475 

j* i 



E(Y 2 ) 



52 



= E4P/ =76 * 905 

y* * 



where setj corresponds to Caucasians and set 2 corresponds to 
non- Caucasians . Using equation (4.10) the Var(Y t ) = 129.5 and 
Var(Y 2 ) = 30.97. The overall paygrade estimated expected 
value and variance is the sum of the set values 

2 

E(Y e _ ,) =J^E(Y,) =430.38 

f=l 



2 

Var{Y E . x ) =J^ Var(Yf) =160.47. 






For working purposes these values are rounded down. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 



A. CONCLUSIONS 

There are approximately 30,100 enlisted persons in the CG 
at any given time. A method is needed to predict monthly 
attrition rates to maintain the CG enlisted hierarchical 
structure. Survival analysis was used to examine the 
characteristics of sex, marital status, race, and MOS group 
behavior of CG enlisted personnel. The results were fairly 
interesting. Males and married persons tended to have higher 
probabilities of survival than did females and individuals who 
were not married. Non- Caucasians in most of the paygrades had 
higher survival probabilities than did Caucasians. MOS groups 
demonstrated different characteristic behaviors among 
different paygrades. MOS groups in paygrades E-4 and E-5 
behaved about the same up to 48 months. After 48 months all 
groups showed a rapid decline in survivor probabilities with 
the aviation group having the highest probabilities and the 
technical group having the lowest. This drop is indicative of 
the end of a four year enlistment when individuals can usually 
separate from the CG. MOS groups in paygrades E-6 through E-9 
all had very high probabilities of survival and demonstrated 
a decline at certain months. E-6 showed this decline at about 
200 months and E-7 through E-9 demonstrated this at about 240 
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months. The drop in probabilities at 240 months is easily 
explained. Many Coast Guardsmen retire at 20 years and 
receive a substantial pension and many important benefits (ie. 
medical coverage for the person and his or her dependents) . 
The drop at 200 months in paygrade E-7 is an anomoly. An E-7 
at this point in his/her career has almost 17 years of service 
time and has only about three more years to serve before 
retirement. Separating at 17 years seems to be financially 
inappropriate because the person will not receive a pension 
nor benefits if they separate. 

Modelled survivor functions were developed for paygrades 
E-8 and E-9 because these paygrades possessed small population 
sizes. Despite the fact that the models produced generally 
poor fits, it was felt that these survivor functions would be 
more reliable than survivor functions generated from very 
small data sizes. Therefore models were used to calculate 
attrition in paygrades E-8 and E-9. 

The binomial counting model provides an effective tool for 
calculating monthly attrition figures. Expected values and 
variances can be calculated by partitioning each paygrade into 
several binomial experiments. This model is easy to use and 
can be expanded in future studies. 

B. RECOMMENDATIONS 

As a first study the goals of this thesis were met. 
However further studies are required to better analyze CG 
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enlisted attrition. The counting model could be adjusted or 
refined based on findings from the recommendations listed in 
this section. 

1. Recommendation 1 

The counting model needs to be verified. Personnel 
data from early months of FY 92 need to be processed through 
the model to verify that it provides better results than the 
current calculation method. 

2 . Recommendation 2 

Additional historical data needs to be analyzed to 
provide temporal stability to the study. The counting model 
produced in this study assumes that the characteristic 
behaviors of individuals identified in FY 91 will repeat 
themselves in FY 92. 

3 . Recommendation 3 

A more manageable data base needs to be identified 
that allows for accessible man-machine interface. An 
accessible data base allows the analyst to verify, manipulate, 
and easily examine data entries as an on-going task. This 
provides data reliability. The CG PPC is not responsible for 
maintaining this type of data base, thus PWP should examine 
this issue. 

4. Recommendation 4 

Individual MOSs within MOS groups need to be examined 
to ensure they demonstrate similar attrition behaviors. 
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5 . Recommendation 5 



Combined personnel characteristics need to be analyzed 
for survival behaviors and seasonal attrition trends need to 
be examined. 

6. Recommendation 6 

The data base needs to be extended in time. At 
present there is no justification for assuming time 
homogeneity of rates. Such an extension would also allow the 
study of some important exogenous variables, such as 
unemployment rate and inflation. 

7 . Recommendation 7 

The educational levels of Coast Guardsmen should be 
analyzed for their effect on attrition behaviors. High school 
graduates, alternate high school degree holders, and non-high 
school graduates need to be considered to determine if there 
are differences in attrition behaviors amongst these groups. 
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APPENDIX A. PROCEDURES FOR ACCESSING DATA 



1. Data is stored on two magnetic disks. To access disks: 

a. login to mainframe computer 

b. type "getmvs mvs215 (separation data tape) " or 
"getmvs mvs217" (active duty data tape) 

2. To run a program that has a huge output file, ie. a file 
that will not fit on an A-disk: 

a. type "tdisk 6" to get a temporary disk with 6 
cylinders 

b. switch A-disk with temporary disk ( say temp disk was 
assigned "D 130" 

1. type "access 130 A" 

2. type "access 191 D" 

c. copy program from the now D-disk to the newly created 
A-disk 

d. run program 

e. switch disks back when finished by reversing procedure 
in 2 .b 
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APPENDIX B. FIGURES 




Figure 1. Survivor Function Paygrade E-l by Marital Status 
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Figure 2. Survivor Function Paygrade E-4 by Sex 
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PAYGRADE E-l BY RACE 




Figure 3. Survivor Function Paygrade E-l by Race 
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Figure 4. Survivor Function Paygrade E-2 by Race 
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Figure 5. Survivor Function Paygrade E-3 by Rate Obtainment 
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Figure 6 . 



Paygrade s E-4, 



E-5, 



E-6 by Rate Group 
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SURVIVAL FUNCTIONS BY RATE GROUP 
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Figure 7 . 



Paygrades E-7, 



E-8, 



E-9 by Rate Groups 
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Figure 8. Model Fit Paygrade E-8 
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PAYGRADE E-8 




Figure 9. Model Fit Paygrade E-8 
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Figure 10. Model Fit Paygrade E-8 



44 



PAYGRADE E-9 




Figure 11. Model Fit Paygrade E-9 



PAYGRADE E-9 




Figure 12. Model Fit Paygrade E-9 
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APPENDIX C 



PROCEDURES FOR GENERATING SURVIVAL FUNCTIONS 



FILE; LIFETEST CODC2 Al 



TlTLE’E-2 »V RACE * /■ THIS SAS CODE TAKES A DATA SET AND GENERATES 
SURVIVOR FUNCTIONS FOR THE DATA. THIS CODE MAS USED FOR PAYGRADE E-2 
AND CREATES SURVIVAL FUNCTIONS FOR CAUCASIANS AND NON-CAUCAS I ANS . ■/ » 

DATA P02 I 

INPUT RATE RANK SEX AGE MAR RACE MONTHS CEN /-THIS LINE DEFINES THE 
VARIABLES WITHIN THE DATA SET. -/I 

GROUP -(RACE'S) /'THIS SEPARATES ALL CAUCASIANS FROM NON-CAUCASIANS. 

S INDICATES CAUCASIAN. THIS SPLITS THE DATA SET INTO 2 SET» I CONT1AWS 
CAUCASIANS. The OTHER CONTAINS ALL OTHER MINOR I T I TES . ■/ J 

CARDS /-TELLS SAS THAT THE DATA SET FOLLOWS. DATA SET HUST BE PLACED 
IHHEDUTELV AFTER THIS LINE AFTER THE SEMICOLON. AND MUST HAVE A 
SEMICOLON PLACED IMMEDIATELY AFTER THE LAST FILE.-/ I 



PROC LIFETEST PLOT-(S) KAXTIME-40 OUTSERV-POCEST /-THIS IS THE MAIN CODE 

it creates a survival function plot (S). and limits the x-axis of the 

PLOT TO -8 MONTHS. ALSO STGRE THE PRODUCT LIMIT ESTIMATES IN A SAS FILE 
CALLED PC2EST . ■ / ; 

TIME MONTMS-CENU) /-DEFINES T>€ TIME VARIABLE (HCNTHS). AND SETS THE 
CENSORING VARIABLE NITHIN T)€ DATA SET EQUAL TO 1 ■/ » 

STRATA GROUP /-PERFORMS THE PROCEDURE ON ALL DEFINED GROUPS. IN THIS CASE 

there are only 2 groups-/ i 

PROC PRINT DATA-POCEST /-PRINTS TKC DATA SET P02EST INTO A CHS FILE-/J 



FILE: LIFETEST CODE Al 



TITLE’PAYGRADE E-* BY RATE GROUP * /-SAME AS PREVIOUS CODE EXCEPT 
IT DIVIDES THE PAYGRADE INTO S GROLPS t I FOR EACH MOS GROUP. GROUP 1 
IS THE AVIATION PEOPLE . 2-ADMIN . 3 • TECHNI CAL . 4 « SK ILLED . S *CG PRO ■/ I 

Data P09» 

INPUT RATE RANK SEX AGE MAR RACE MONTHS CEN| 

IF RATE = SiOlRATE = S70lRATE-S20IRATE«S30 IRATE*SS0 THEN GP-H 

ELSE IF RATE: 540tRATE«A2Q THEN GP*2i 

ELSE IF RATE«?TQIRATE»18Q l RATE - IA0IRATE-270 I RATE -240 I RATE - 280 THEN GP « 5 I 
ELSE IF RAlE^SOO IRATE'8A0 IRATE-B70 THEN GP-4: 

ELSE GP-S: •' 

CARDS; / 

i 

PROC LIFETEST PLOT-IS) MAXTIHE-300 OUTSERV-POJESTl 
TIME HONTHS«CEN( I )» 

STRATA GP; 

PROC PRINT DATA-POTESTl » 
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INTEGER I.M.N.P 



PROGRAM HOOF I T 2 

C THIS PROGRAM GENERATES MODELLED PRODUCT LIMIT ESTIMATES FOR 

C PAfGRAOES WHICH HAVE BEEN PARTITIONED BV 5 MOS GPOUPSj AVIATION. 

C ADMINISTRATIVE, TECHNICAL, SKILLED. CG PROFESSIONAL. (T USES AS 

C INPUT REGRESSION COEFFICIENTS t BETA I THROUGH BETA] 3 ), SHAPE 

c PARAMETERS CPI AND PZ ) , SCALE PARAMETERS CLAHBDAI AND LAMBDAZ), 

c AND A TIME PARAMETER (T). THIS PROGRAM 1$ ALSO DESIGNED TO WORK 
C ON DATA THAT FIT A 3 PIECE WEI BULL. VARIABLES BETA] THRU BETA*. 

C PI, AND LAMBDA I ARE VALUES FOR THE FIRST WEIBULL PIECE CP IS TT« 

C TIME VALUE OF kHERE THE FIRST BREAK OCCURS). VARIABLES BETAS THRU * 
C BETAS • P2 « and LAMBDA? (AND M) CORRESPOND TO THE SECOND PIECE. 

C VARIABLES BETAS T^U BETAI2. PS. LAMBDAS CAND N) CORRESPOND TO 

C LAST WEIBULL PIECE. VARIABLE BETA! 3 EOUALS ZERO AMD CORRESPONDS 
C TO THE CG PROFESSIONAL COEFFICIENT. THE VARIABLE SHAT IS THE 

C MODELLED PRODUCT LIMIT ESTIMATE: SHAT I FOR THE AVIATION MOS 

C GROUP. SHATT FOR ADMIN GROUP, SHAT3 FOR TECHNICAL GROUP. SHAT4 FOR 

C THE SKILLED GROUP. AND SKATS FOR THE CG PRO CROUP. SHAT VALUES AR 

C CALCINATED FOR TIME PERIODST. CT-I N). WHERE M ]A THE LAST 

C TIME VALUE NEEDED FOR THE PAYGRADE. 

C THE ESTIMATES ARE WRITTEN ON THE CMS FILESl FT88F00I . FT19F00I. 

C AND FT90F00I . 



REAL LAMBCAI .PI .BETAI .BETA?, BETAS. BETA4.BETA5.BETAI3 
REAL LAMBDA? •PZ,BE t A|,BE t A7,BETA8,BETAT.BETAI0.BETAII.J 
REAL SHAT I . SHAT? , SHA T3 « SHAT 4 , SHAT5 .LAMBDAS , P 3 

, C FOLLOWING ESTABLISHES all VARIABLE VALUES. 

PARAMETER CM* 1 0 .LAMBDA I ■ . 1 007 .PI -2.485) 

PARAMETER l BETA I -0.0. BET A2— .1*38 . BETAS*-. 08 33, BET AC 3 

PARAMETER CN*200 .LAMPDAC.0. 00003 .PZ-4 . 43S ) 

PAR AMETERC BETA5 0SC5. BETA4-.1 M .BETA?— .0515 .BETA8— 4 
PARAMETER CP -420 .LAMBDAS-, 0028 .P3*8.0334 ) 

PARAMETER CBETA9-.0254.BETAI0-.03O?,BETA]I-.I04,BETaI?-- 
PARAMETERCBETAI3-0.0 ) 

C CALCINATES SHATS FOR FIRST WEIBULL PIECE. 

DO 10 1-1. M 

, SHATI-EXPC-C CLAHBDAI -I -EXP C- BETAI ))--PD) 

' SHAT? *EXP C- C C LAMB DA I • I -EXP (-BETA? ) )- -PI ) ) 

SHAT3»EXP(-(CLAMBDAI-I-EXP(-BETA3))--P1 )) 

SHAT4-EXP(-( CLAMBD* I • I -EXP C-BETA4 ))--PI ) ) 

SHAT5-EXPC- C (LAMBDA! • I-EXPC-BETAI 3))--PI ) ) 

WRITE (88. 9 1 ) I • SHAT I , SHAT 2 , SHAT 3 .SHAT4 .SHAT 5 
10 CONTINUE 

C CALCULATES SHATS FOR SECOND WEIBULL PIECE. 

DO 20 I *M * j , N 

Shat I* EXP (- ( (Lambda:- I -EXP C-BETA5))--P2)) 

SKAT2-EX0i-( f lambda:- 1 -EXP (-BETA4) )--P2 ) ) 

shat3»expc- c clambda:-i-expc-beta?))--p:) j 

SHAT 4* EXP (- ( (LAMBDA?- I -EXP (-BETAS ) )--P? ) ) 

SHATS »EXP(- ( CL*mBCa:-I-EXP(-BETAI3))--P:)) 

WRITECS9.9I )I .SHAT I . SHAT2 . SHAT 3 . SHAT 4 , SHATS 
20 CONTINUE 

C CALCINATES SHATS FOR LAST PIECE. 

DO 30 I-NM.P 

SHA7I-EXPC- ((LAMBDAS* I -EXP (-BETA9))- -PS)) 

SMAT2*EXP(-( (LAMBDAS- I -EXP(-BETAIO))- -PS)) 

SHATS -EXP (-( (LAHBDA3-I-EXP(-BETAII))-.P3)) 

SHAT4 -EXP (-((LAMBDAS- I -EXP (-BETAI2 ) )--P3 )) 

SHATS *EXP(-( (LAMBDA!- I -EXP (-BETAI 3 ))--P3)) 

WRITE (90. 9 I ) I. SHAT I, SHAT?. SHATS. SHAT4, SHATS 
SO CONTINUE 

91 FORMAT(IX,IS.3X.F4.4,3X,F4.4.3X,F4.4.3X.F4 .4.3X.F4.4) 
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APPENDIX D. PROCEDURES FOR CONDUCTING REGRESSION ANALYSIS 



PROGRAM RECPREP 

C THIS CODE SEPARATES a PAYCRADE data set by mos group and CREATES 
C DATA SETS NEEOED TO CONDUCT THE SAS L1FEREG PROCEDURE. AGAIN THIS 

C CODE IS DESIGNED FOR A 3 PIECE ME1BLAL. THE INPUT FILE (*2) 

C CONTAINS THE PAVGRADE DATA SET. 3 OUTPUT DATA SETS ARE CREATED TO 

C CONFORM TO THE 3 PIECE WEIBW-t. THE VARIABLE N IS THE NUMBER OF 

C FILES IN THE PAYCRADE. 

integer l.N. RATE. MONTHS. CEN 
parameter (N* 448 ) 

OPEN(2.F1LE-*P08 data a*) 

DO 10 1 • 1 . N 

READ ( 2.91) RATE. HONTHS. CEN 
C THIS IS THE FIRST PIECE OF THE WEIB’JLL 
IF ( MONTHS .LE. 10) THEN 

IF ( RATE .EQ. 520 .OR. RATE .EO. 540 .OR. RATE .E0.570 .OR. 

1 RATE .EO. 530 .OR. RATE .EO. 5501THEN 

WR 1 TE t 2 4. 9 3 ) MONTHS. CEN 
ELSEIFfRATE .EO. 340 .OR. RATE .EO. 420) THEN 
NRITE<24.*4) MONTHS- CEN 

ELSE1F<RATE .EO. 270 .OR. RATE .EO. 240 .OR. RATE -EO. 180 
1 .OR. RATE .EO. 7*0 .OR. RATE .EO. 14® .OR. RATE .EO. 

1 2f 0 ) Then 

WPITE(24.95) MONTHS. CEN 

ELSEIF {RATE .EO. 870 .OR. RATE .EO. 500 .OR. RATE .EO. 840)THEN 
WR1TE(2«.U) MONTHS. CEN 
ELSE 

WR1TE(24.97) MONTHS. CEN 
EN01F 

C THIS IS THE SECOND PIECE OF THE WE19XL. 

ELSE1F1HCNTMS .GT. 10 .AND. MONTHS .LE. 200)THEN 

IF (RATE .EO. 520 .OR. RATE .EO. 540 .OR. RATE .EQ.570 .OR 
1 RATE -EO. 530 .OR. RATE .EO. 550 ) THEN 

WR1TE<27.*3) MONTHS. CEN 
EL$E1F(RATE .EO. 340 .OR. RATE .EO. 420) THEN 
WRlTE<27.f4) MONTHS. CEN 

ELSE1F(RATE -EO. 270 .OR. RATE .EO. 240 .OR. RATE .EO. 180 
1 .OR. RATE .EO. 7*0 .OR. RATE .EO. \4p .OR. RATE .EO. 

1 280) THEN 

WR I TE (2 7 . f 5 ) MONTHS. CEN 

ELSEIF ( RATE .EO. 870 .OR. RATE .EO. 500 .OR. RATE -EO. B40)T*«N 
NR ITE ( 27 • *4 ) MONTHS. CEN 
ELSE 

NR 1 TE ( 27 . f 7 ) MONTHS. CEN 
ENOIF 



c THIS IS THE LAST PIECE OF THE ME 1 BULL . 

asE 

1F<RATE .EO. 520 .OR. RATE .EO. 540 .OR. RATE .EQ 570 .OR. 

I RATE .EO. 530 .OR. RATE .EO. 550) THEN 

WRITE(28.*3) HONTHS. CEN 
ELSE1F(RATE .EO. 340 .OR. RATE .EO. 420) THEN 
HR 1 TE ( 28 • 94 ) MONTHS. CEN 

ELSE IF < RATE .EO. 270 . CR . RATE .EO. 240 .OR. RATE .EO. 180 
1 .OR. RATE .EO. 7*0 .OR. RATE .EO. 140 .OR. RATE .EO. 

I 280) THEN 

MRITE(28 ■ *5 ) MONTHS. CEN 

CLSEIF(RATE .EO. 870 .OR. RATE .EO. 500 .OP. RATE .EO. 840 )THEN 
MR ITE (28 . *4 ) MONTHS. CEN 
ELSE 

WR 1 TE ( 28 « * 7 ) MONTHS. CEN 
END1F 
END1F 
10 CONTINUE 

C THIS IS THE FORMAT FOP THE READ/l^UT STATEMENTS. 



*2 FORMAT (1X.13.13X. 13. IX. II ) 

c THESE FOR HAT STATEMENTS CREATE REGRESSION DATA SETS. HONTHS IS T»« 

C DEPENDENT VARIABLE AND IS LOCATED IN THE FIRST COLUMN. THE 

C RRECEO.NG COLUMNS CONTAIN A 0 OR I IF THE PERSON FALLS INTO ONE 

C OTF THE HOS GROUPS. I THE STM MOS GROUP IS ASSUME 0 AND DOES MOT SHO 
c IN THE DATA SET. 

*3 FORMAT ( IX. 13. IX. * 1* . IK.’O* .IX.’ O’ .lX.'O* .IX. 11) 

*4 FORMAT! IX. 13. IX. *0* » I X » ' 1 ' • I X • * 0 * • I X » * 0 ‘ # 1 X « 1 1 ) 

*5 FORMAT < IX. 13 .IX. 'O’. IX. *0’. IX. T. IX. '0'. IX. ID 
94 FORMAT ( IX. 13 . IX. ' O' . IX . *0* . IX. *0' . IX . * 1* . IX. II ) 

97 FOR HAT ( IX, 13. IX. ’0’ .IX. *0'. IX. *0*. IX. *0*. IX. II) 

END 
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MLE: L1FEREG CODE 



A1 



TITLE ‘REGRESS !OW OUTPUT PAVGRADC E-<* /»SAS CODE PERFORMS A PEORESSIOW 
OW CENSORED DATA. GENERATES REGRESSION COEFFICIENTS. SHAPE AND SCALE 
PARAMETERS. OUTPUT IS VERV SMALL. •/ I 

data po«j 

INPUT MONTHS AVIATION AOMIN TECH SKILLED CENi 
CARDSl 

I 

PROC LIFEREC OUTEST « MODEL I 

CLASS AVIATION ADMIN TECH SKILLED /-SETS THE VARIABLES AVIATION. ADMIN. 
TECH. AND SKILLED AS CATEGORICAL (0 OR I) VARIABLES •/ I 

MODEL MONTHS *CEN( 1 )• AVIATION ADMIN TECH SK ILLED/D 1 ST» WE ! BULL /-SETS THE 
DEPENDENT VARIABLE EOUAL TO MONTH, SETS THE CENSORING VAR I ABLE .CEN« I . 
SETS THE INDEPENDENT VARIABLES EQUAL TO AVIATION. ADMIN. TECH. SKILLED. 
ESTABLISHES THE WE 1 BULL AS THE PARAMETRIC DISTRIBUTION •/ » 
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